e 8.19 shows a new RPN chromosome for the RPN chromosome
the upper panel of Figure 8.13. Figure 8.20 shows the resulting
he training of a GP model
ning process is implemented in an iterated process. To start a GP
process, a pool of randomised RPN chromosomes are generated.
ach iteration of a learning process, the first job is to measure the
f each RPN chromosome to a given data set. The fitness
ment is composed of two parts. They are the model accuracy and
h of an RPN chromosome. The former is used to rank RPN
omes in terms of their fitness measurements to a data set. The
sed to award more parsimonious RPN chromosomes.
ose a problem is a regression question. The target variable or the
nt variable of a regression model is denoted by y and the output of
described by an RPN chromosome is denoted by ݕො. The sum of
ed errors between them is defined as below, where N is the total
of the observed target values, ݕ and ݕො stand for the ith observed
ue and the ith predicted value from a model defined by an RPN
ome,
ߝൌ1
ܰሺݕො
ே
െݕሻଶ
ୀଵ
(8.5)
ose the length of an RPN chromosome is denoted by ℓ. The
easure is defined as below, where m is the mth model described
N chromosome, 0 ൏ߙ൏1 stands for the trade-off constant, ߝ
or the sum of the squared errors measured for the mth RPN
ome,
ߴൌߙൈߝሺ1 െߙሻൈℓ
(8.6)
wo terms of the above fitness measurements need to be as small
ble. The best model is determined by the following equation,